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Abstract 



We present a novel approximation algorithm for fc-median that achieves an approximation 
guarantee of 1 + v3 + e, improving upon the decade-old ratio of 3 + e. Our approach is based 
on two components, each of which, we believe, is of independent interest. 

First, we show that in order to give an a-approximation algorithm for fc-median, it is sufficient 
to give a pseudo-approximation algorithm that finds an a-approximate solution by opening 
k + 0(1) facilities. This is a rather surprising result as there exist instances for which opening 
k + 1 facilities may lead to a significant smaller cost than if only k facilities were opened. 

Second, we give such a pseudo-approximation algorithm with a — 1 + y3 + e. Prior to 
our work, it was not even known whether opening k + o(k) facilities would help improve the 
approximation ratio. 

1 Introduction 

Suppose you wish to select k polling stations for the US election so as to minimize the average 
distance each voter has to travel to his/her closest polling station. Then you need to solve the 
classic NP-hard fc-median problem that we shall design better approximation algorithms for in this 
paper. Formally, a A;-median instance I is defined by the tuple (k,J-,C,d), where k is the number 
of facilities allowed to be opened, J 7 is a set of potential facility locations, C is a set of clients, and 
d is a distance metric over J^UC. The goal is to open a set S C T of k facilities so as to minimize 
costx(S) = SieC^C?)^)) where d(j,S) denotes the distance from j to its nearest facility in S. 
When T = C = X, a solution S partitions the set of points into what is known as clusters and thus 
the objective measures how well X can be partitioned into k clusters. The /c-median problem has 
numerous applications, starting from clustering and data mining [3J to assigning efficient sources 
of supplies to minimize the transportation cost ( [Lil 120] ) . 

The difficulty of the fc-median problem lies in the hard constraint that only k facilities are 
allowed to be opened. Indeed, without such a constraint, we could simply open all facilities. 
Early approaches [HI [TTJ, [13] overcame this difficulty by giving pseudo-approximations that obtain 
better guarantees while violating the mentioned constraint by opening k + O(fc) facilities. The 
first constant factor approximation algorithm that opens k facilities is due to Charikar et al. [6]. 
Based on LP rounding, their algorithm produces a 6|-approximation. Several of the ideas in [6] 
are inspired from constant factor approximation algorithms obtained for the closely related metric 
uncapacitated facility location (UFL) problem. The UFL problem has similar input as A;-median 
but instead of giving an upper bound k on the number of facilities we can open, it specifies an 
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opening cost fi for each facility i £ T . The goal is to open a set of facilities S that minimizes the 
sum of the opening costs and connection costs, i.e., Ylies + costj(5). 

The connection between UFL and /c-median is motivated by basic economic theory: if we let the 
opening costs of facilities be small then a "good" solution to UFL will open many facilities whereas 
if we let the opening costs of facilities be large then a good solution will only open few facilities. 
By appropriately selecting the cost of facilities, one can therefore expect that an algorithm for 
UFL opens close to k facilities and therefore almost also gives a solution to the /c-median problem. 
This is the intuition of the concept of bi-point solutions that we define in Section |1.2| Jain and 
Vazirani first exploited this concept in a beautiful paper [12] to obtain a 6-approximation algorithm 
for /c-median using their 3-approximation primal-dual algorithm for UFL. The factor 3 was later 
improved by Jain et al. [H] to 2 resulting in a 4-approximation algorithm for /c-median. 

In spite of the apparent similarities between UFL and /c-median, current techniques give a 
considerable better understanding of the approximability of UFL. For UFL and its variants, there 
has indeed been a steady stream of papers giving improved algorithms [T7J [22j [T21 [7J [131 13 EH Ell 
\19\ H] . The current best approximation algorithm is due to Li |16] . He combined an algorithm by 
Byrka [1] and an algorithm by Jain et al [11] to achieve an approximation guarantee of 1.488. This 
is close to being best possible, as it is hard to approximate UFL within a factor of 1.463 [9]. In 
contrast there has been less progress for /c-median and the approximability gap is larger. The best 
known approximation algorithm is the local search algorithm given by Arya et al. [2]. They showed 
that if there is a solution T' , where any p swaps of the open facilities cannot improve the solution, 
then T 1 is a 3 + 2/p approximation. This leads to a 3 + e approximation that runs in time n 2 / e . 
On the negative side, Jain et al. |llj proved that the /c-median problem is hard to approximate 
within a factor 1 + 2/e ~ 1.736. Moreover, the natural linear programming relaxation of A;- median 
is known to have an integrality gap of at least 2. The best upper bound is by Archer et al. [1], who 
showed that the integrality gap is at most 3 by giving an exponential time rounding algorithm that 
requires to solve the maximum independent set problem. 

As alluded to above, the main difficulty of the /c-median problem is the hard constraint that 
we can open at most k facilities. In this paper we take a different approach that allows us to relax 
this constraint and thereby addressing the problem from a novel point of view using what we call 
a pseudo-approximation algorithm. This leads to the improved approximation algorithm breaking 
the barrier of 3 that we discuss next. 

1.1 Our Results 

Our improved approximation algorithm can be stated as follows. 

Theorem 1. There is an algorithm which, given a k-raedian instance X and a number e > 0, 
produces a 1 + \/3 + e-approximate solution to X in running time O (^n ^ 1 ^ 2 ^ . 

Our algorithm contains two main components, each of which, we believe, is of independent 
interest. First, we show that in order to give an approximation algorithm for /c-median, it suffices 
to give a pseudo- approximation algorithm A which, given a /c-median instance I, outputs a set 
S C T of k + c facilities with costj(5) < aoptj, where optj is the cost of optimum solution for 
X. Given such an algorithm A as a black box, we can design an a + e-approximation algorithm A' 
whose running time is n°' c / £ ' times that of A. Interestingly, the instance (see Figure [IJ that gives 
the integrality gap of 2 for the natural LP relaxation of /c-median vanishes if we allow the integral 
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solution to open k + 1 facilities. This suggests that our reduction may bring in new avenues for 
approximating /c-median. In particular, we find the following open problem interesting: given a 
/c-median instance X, what is the maximum ratio between the cost of the optimum integral solution 
of X with k + 1 open facilities, and the LP value (with k open facilities)? 

To complement the first component, we give the aforementioned pseudo- approximation algo- 
rithm A with a = 1 + \/3 + e. Prior to our work, it was not even known whether opening k + o{k) 
facilities would help improve the approximation ratio; all known pseudo-approximation algorithms 
require k + S7(/c) open facilities. In contrast, our algorithm only opens k + 0(1 /e) facilities. The 
algorithm A contains 2 steps. We obtain a bi-point solution for /c-median using the algorithm of 
We lose a factor of 2 in this step. Then, we convert the bi-point solution into an integral 
solution with k + 0(l/e) open facilities, losing another factor of in the approximation ratio. 

We remark that if we had insisted on opening k facilities, then a factor of 2 has to be lost in the 
last step as the instance achieving an integrality gap of 2 has a bi-point solution. 

Theorem [I] does not give a better upper bound on the integrality gap of the natural LP due 
to the following reason: instead of running the pseudo-approximation algorithm A on the input 
instance X, we run it on a residual instance X' obtained from X by removing a subset of facilities 
that the optimal solution does not open. The way we obtain X' is to guess 0(1/ e 2 ) "events" and let 
X' be the instance conditioned on these events. Due to this nature, our algorithm can be converted 
to a rounding algorithm based on solving an 0(l/e 2 )-level LP in the Sherali- Adams hierarchy. 
Instead of guessing the 0(l/e 2 ) events, we can now find these events explicitly by looking at the 
LP solution. Conditioning on these events, we obtain a fractional solution of the basic LP. By 
rounding this LP, we obtain a 1 + \/3 + e-approximate solution. Thus, our approach can be seen 
to give an 1 + \/3 + e-upper bound on the integrality gap of the 0(l/e 2 )-level LP in the Sherali- 
Adams hierarchy. Our result was in fact first obtained by studying the power of the Sherali- Adams 
hierarchy for the /c-median problem. However, as it can also be obtained using a combinatorial 
approach with less cumbersome notation, we have chosen to present that approach. 

1.2 Preliminaries 

Given a /c-median instance I = ( k, J 7 , C, d) , a pseudo-solution to X is a set S C T . A pseudo-solution 
S satisfying |<S| < k is a solution to I; a pseudo-solution S with |<S| < k + c, for some number 
c > 0, is called a c-additive (pseudo-) solution. The cost of a pseudo-solution S to X is defined as 
costx(5) = J2jeC "-*)' wnere d(j, S) denotes the distance from j to its closest facility in S. We let 
OPTj denote an optimal solution to X, i.e., one of minimum cost, and we let optj = costj(OPTj). 
To avoid confusion we will throughout the paper assume that the optimal solution is unique and 
that the concept of closest facility (or client) is also uniquely defined. This can be achieved either 
by slightly perturbing the metric or by simply breaking ties in an arbitrary but fixed way. 

When considering a client or facility, it shall be convenient to argue about close clients or 
facilities. For any p £ JUC and r > 0, we therefore define FBallx(p, r) = {i G J: d(p,i) < r} and 
CBallj(p, r) = {j £ C : d(p,j) < r} to be the set of facilities and clients within distance less than 
r from p, respectively. When X is clear from the context, we omit the subscripts in costj, OPTj, 
optj, FBallj, and CBallj. 

The standard linear programming relaxation for the fc-median problem is formulated as follows. 
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Figure 1: Instance that gives integrality gap 2/(1 + 1/k) and the optimal fractional solution. We 
have k + 2 facilities and k + 1 clients co-located with the top k + 1 facilities. All edges in the graph 
have length 1. The optimal integral solution has cost 2, while the optimal fractional solution has 
cost (k + 1) • + I • 1) = 1 + 1/k. 



minimize Y,ieF,jeC d (hj)xij 

subject to Yji&TVi < k ( la ) 

Eie^^ii = 1 JSC (lb) 

Xij <yi i £ eC (lc) 

Jcy,!/i 6 [0,1] i£T,j£C (Id) 



Constraint (la) says that we are allowed to open at most k facilities, Constraint (lb) says that we 



must connect each client, and Constraint (lc) says that if we connect a client to a facility then that 
facility has to be opened. 

As mentioned earlier, the above linear programming has an integrality gap of 2, even when the 
underlying metric is a tree. The instance that gives the integrality gap of 2 is depicted in Figure [TJ 
It is a star with fc + 1 leaves. The center of the star is a facility and the leaves are both facilities and 
clients. Note that a pseudo-solution that opens all leaves, i.e., + 1 facilities, has cost whereas 
any solution that opens only k facilities has cost 2. The solution to the linear program obtained 
by a linear combination of the pseudo-solution that opens all leaves and the solution that only 
opens the center of the star has cost 1 + 1/k yielding the integrality gap of 2 when k tends to 
infinity. In general, a solution that is a linear combination of two pseudo-solutions is called a bi- 
point (fractional) solution. As this concept is important for our pseudo-approximation algorithm, 
we state its formal definition. 

Definition 2 (bi-point (fractional) solution). Let I = (k,J-,C,d) be a k-median instance. Let Si 
and 1S2 be two pseudo-solutions to I such that \S\\ < k < j^l- Let a > 0, b > be the real numbers 
such that a + b = 1 and a \S\ \ + b I1S2I = k. Then, the following fractional solution to I, denoted by 
aS\ + bS2, is called a bi-point (fractional) solution: 

1. yi = al ie si + bl ieS2 ; 

\st(i,Si,j) + ^ldst(i,S2j')' w ^ ere c ' s t(^'5,j) denotes the event thati is the closest facility 

in S to j . 
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It is easy to see that the cost of the fractional solution aS\+bS2 is exactly acostx(5i)+6costj(52). 
Jain and Vazirani |12j gave a Lagrangian multiplier preserving 3-approximation for UFL, which 
immediately yields an algorithm which produces a bi-point solution whose cost is at most 3 times 
the optimum. Together with an algorithm which converts a bi-point solution to an integral solution 
at the cost of a factor 2, [12] gave a 6-approximation for fc-median. Later, the factor 3 was improved 
by Jain et al. to 2. We now formally state the result of 

Theorem 3 ([H]). Given a k-median instance I, we can find in polynomial time a bi-point solution 
aS\ + bS2 to X whose cost is at most 2 times the cost of an optimal solution to X. 

1.3 Overview of the Algorithm 

The two components of our algorithm are formally stated in Theorem [4] and Theorem [5j whose 
proofs will be given in Sections [2] and [3j respectively. Together they immediately imply Theorem [l} 

Theorem 4. Let A be a c-additive a- approximation algorithm for k-median, for some a > 1. 
Then, for every e > there is a a + e- approximation algorithm A' for k-median whose running 
time is O (n ^/^) times the running time of A. 

Theorem 5. There exists a polynomial time algorithm which, given a k-median instance X = 
(k,J-,C, d) and e > 0, produces an 0(l/e)- additive 1 + \/3 + e-approximate solution to X. 

We now provide more details about the proof of the two theorems. At first glance, it seems that 
the transformation from a pseudo-approximation to a real approximation stated in Theorem [4] is 
impossible, since there are cases where allowing k + 1 open facilities would give much smaller cost 
than only allowing k open facilities. However, we show that we can pre-process the input instance 
so as to avoid these problematic instances. Roughly speaking, we say that a facility i is dense if 
the clients in a small ball around i contribute a lot to the cost of the optimum solution OPT (see 
Definition [6]) . We guess the 0(1/ e) densest facilities and their respective nearest open facilities 
in OPT. Then for each such dense facility i whose nearest open facility in OPT is i! , we remove 
all facilities that are closer to i than i! (including the dense facility i). Then we get a residual 
instance in which the gap between the costs of opening k + 0(1) and k facilities is small. The 
pseudo-approximation algorithm is then applied to this residual instance. 

For example, consider the integrality gap instance depicted in Figure [1] and let OPT be the 
optimal solution that opens the center and k — 1 leaves. Then the two leaves that were not opened 
contribute a large fraction of the total cost (each contributes opt/2 to be precise) and the two 
corresponding facilities are dense. By removing these dense facilities in a preprocessing step, the 
gap between the costs of opening k + 0(l) facilities and k facilities for the residual instance becomes 
small (actually in this example). 

Regarding the proof of Theorem [5j we first use Theorem [3] to obtain a bi-point solution for k- 
median whose cost is at most twice the optimum cost. Jain and Vazirani [12j showed how to convert 
a bi-point solution to an integral solution, losing a multiplicative factor of 2 in the approximation. 
As we previously mentioned, this factor of 2 is tight, as the fractional solution for the gap instance 
in Figure [I] is a bi-point solution. Thus, this approach can only yield a 4-approximation. 

This is where the c-additive pseudo-approximation is used and again the integrality gap instance 
depicted in Figure [T] inspired our approach. Recall that if we open the k + 1 leaves of that instance, 
then we get a solution of cost 0. In other words, by opening 1 additional facility, we can do better 
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than the fractional solution. One may argue that this trick is too weak to handle more sophisticated 
cases and try to enhance the gap instance. A natural way to enhance it is to make many separate 
copies of the instance to obtain several "stars" . One might expect that the fractional cost in each 
copy is 1, the integral cost in each copy is 2 and opening 1 more facility can only improve the 
integral solution of one copy and thus does not improve the overall ratio by too much. However, 
the integral solution can do much better since one cannot restrict the integral solution to open 
k facilities in each star. As an example, consider the case where we have 2 copies. The integral 
solution can open k — 1 facilities in the first star, and k + 1 facility in the second star. Then, the 
cost of this solution is 3, as opposed to 4 achieved by opening k facilities in each star. The gap is 
already reduced to 1.5, without opening additional facilities. Thus, this simple way to enhance the 
instance failed. 

Our pseudo-approximation algorithm is based on this intuition. From the bi-point solution 
aJ-\ + bJ-2, we obtain copies of "stars" (similar to the integrality gap instance). Then for each 
star we (basically) open either its center with probability a or all its leaves with probability b. 
Note that since either the center or all leaves of a star is open we have that a client always has 
a "close" facility opened. With this intuition we prove in Section [3] that the expected cost of the 
obtained pseudo-solution is at most 1+ ^ +e times the cost of the bi-fractional solution if we open 
0(1/ e) additional facilities. The 0(1/ e) additional facilities (and the case distinction in Section [3]) 
comes from the difficulty of handling stars of different sizes. If all stars are of the same size the 
pseudo-approximation algorithm becomes easier (run the algorithm in Section 3.2 with one group 
of stars) and one obtains a 1+ 2 - approximate solution that opens at most k + 3 facilities. 



2 Obtain solutions from additive pseudo-solutions 

In this section, we prove Theorem |4j As we mentioned earlier, there are instances where pseudo- 
solutions opening k + 1 facilities may have much smaller cost than solutions opening k facilities. A 
key concept to overcome this issue is the notion of sparse instances: 

Definition 6. For A > 0, an instance I = (k,J-,C,d) is A-sparse if for each facility i E T ', 

(1 - £)d(i, OPTx) • |CBallx(«, £d(i, OPTx))| < A, (2) 
where £ := 1/3. We shall also say that a facility i is A-dense if it violates 

Recall that d(i, OPTj) is the distance from i to its nearest facility in OPTj. 

The idea of the above definition is to avoid instances where we can significantly reduce the cost 
by opening 0(1) additional facilities. Consider the gap instance X in Figure [I] and suppose OPTj 
opens the center and the first k — 1 leaf- facilities. Then I is not A-sparse for A < optj/2 since the 
last two leaf- facilities are ^4-dense. 

The usefulness of the definition is twofold. On the one hand, we show that we can concen- 
trate on very sparse instances without loss of generality. On the other hand, we show that any 
c-additive pseudo-solution to a sparse instance can be turned into a solution that opens k fa- 
cilities by only increasing the cost slightly. The intuition behind the result that we can only 
concentrate on sparse instances is the following. Consider an instance I that is not optj/i- 
sparse for some constant t. If we consider a facility i that is optj/t-dense then the connection 
cost of the clients contained in CBall(i, £d(i, OPTj)) in the optimal solution OPTj is at least 
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(1 — £)d(i, OPTx)|CBall(i, £d(i, OPTx))| > optj/t. So, there can essentially (assuming disjoint- 
edness of the balls of clients) only be a constant t number of facilities that violate the sparsity 
condition. We can guess this set of dense facilities, as well as their nearest facility in OPTj in time 



This is the intuition of Algorithm [T] (that tries to guess and remove opt/f-dense facilities) and 



the proof of the following lemma which is given in Section 2.1 



Lemma 7. Given a k-median instance X = (k, J 7 , C, d) and a positive integer t, Algorithm^ outputs 
in time many k-median instances obtained by removing facilities from X so that at least one, 

say X' = (k, T' C J 7 , C, d), satisfies 

§7jfi): the optimal solution OPTj to X is also an optimal solution to X' ; and 

^p): X' is opt x /t- s P arse - 

Note that X' is obtained by removing facilities from X. Therefore any solution to X' defines 
a solution to X of the same cost and we can thus restrict our attention to sparse instances. The 
next lemma shows the advantage of considering such instances. Assume we now have a c-additive 
solution T to a sparse instance X. Algorithm [2] tries first in Lines [2]j3] to identify facilities in T 
whose removal does not increase the cost by too much. If the removal results in a set of at most k 
facilities, we have obtained a "good" solution returned at Step [4] of the algorithm. Otherwise, as 
we prove in Section 2.2 using sparsity, more than k — t of the facilities of the solution T are very 
close to facilities in OPTj. Algorithm [2] therefore tries to guess these facilities (the set T>) and the 
remaining facilities of OPTj (the set V). The obtained bounds are given in the following lemma. 

Lemma 8. Given an A-sparse instance X = (k,J r ,C,d), a c-additive pseudo-solution T, 6 G 
(0, 1/8), and an integer t > 2c/(6^), Algorithm]^ finds in time set S C T such that: 

$^fi): S is a solution to X, i.e, \S\ < k; and 

costx(S) < max |cost x (T) + cB, ^§ ■ opt x }, where B := 2 • ^ +cos g (T)A . 

Before giving the proofs of Lemmas [7] and [8] let us see how they imply the main result of this 
section. 



Proof of Theorem g Select the largest 5 G (0,1/8) such that (1 + 36) /(l - 36) < a and 
t := * • = 0(c/e). Given a fc-median instance X, use Algorithm jlj to obtain a set of /c-median 
instances such that at least one of these instances, say X', satisfies the properties of Lemma [7| 
In particular, X' is optj/i-sparse. Now use algorithm A to obtain c-additive pseudo-solutions to 
each of these instances. Note that when we apply A to X' , we obtain a solution T such that 
costj(7") = costx'(T) < a ■ opt x , = a ■ opt x . Finally, use Algorithm [2] (with the same t and 6 
selected as above) to transform the pseudo-solutions into real solutions and return the solution to 
X of minimum cost. The cost of the returned solution is at most the cost of S where S is the 
solution obtained by transforming T. By Lemmas [7] and [8j we have that costj(S') = costx'(S) is at 
most 

/ , optj + costx(T) 1 + 3,5 
max jcostz(T) + c • 2 — , - _ ^ optj 

which in turn, by the selection of 6, £, and t, is at most aoptj + c • Aa °^ 1 < (a + e)optj. 



We conclude the proof of Theorem [4] by observing that the runtime of the algorithm is 
°( c / e ) times the runtime of A. 



n 



0(t) 
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Input: a £;-median instance X = (k, J 7 , C,d) and a positive integer t 

Output: a set of A;-median instances so that at least one satisfies the properties of Lemma [7] 
for all t' < t facility-pairs (12,12), ■ ■ ■ , (h',^') ou tput (k, J 7 ', C, d), where 

J- 1 = J- \ _.. FBall(i z , d(i z , i' z )) f> the facilities that are closer to i z than i' z is to i z are removed 

Algorithm 1: Enumeration of fc-median instances. 

2.1 Proof of Lemma [7} obtaining a sparse instance 

First note that Algorithm [l] selects facility-pairs and can be implemented to run in time 
■nP®. We proceed by showing that for one selection of facility-pairs the obtained instance satis- 
fies the properties of Lemma [7j Consider a maximal-length sequence (12,^2), ■■■ > {Hi^'g) °f 
facility-pairs satisfying: for every b = 1, . . . ,£, 

• it € J r \ Uz=i FBall(^, d{i z , i' z )) is an optj/t-dense facility; and 

• i' b is the closest facility to % in OPTj. 

Note that the instance X' := (k, J 7 ', C, d) with T' = T \ \} z= \ FBall(i z , d(i z , i' z )) is optj/i-sparse 
since otherwise the sequence (12,^2), ■ ■ ■ , (h, i'g) would not be of maximal length. Moreover, 

since we do not remove any facilities in OPTj, i.e., (J- \ J-') n OPTj = 0, we have that OPTx is 
also an optimal solution to X' . In other words, X' satisfies the properties of Lemma [7J 

We complete the proof by showing that Algorithm [l] enumerates X' , i.e., that t < t. For the sake 
of notation let B z := CBall(i, £d(i z , i' z )). First, note that the client-balls 81,82, ■ ■ ■ ,Bg are disjoint. 
Indeed, if a ball 8 Z overlaps a ball 8 W with 1 < z < w < I then d(i z ,i w ) < £d(i z ,i' z ) + £d(i w ,i' w ). 
However, since i w must be in T \ FBa\\(i z ,d(i z ,i' z )), we have d(i z ,i w ) > d(i z ,i' z ). Since i' w is the 
closest facility in OPTj to i w , we have d(i w ,i' w ) < d(i w ,i' z ), which, by triangle inequalities, is at 
most d(i z ,i w ) + d(i z ,i' z ) < 2d(i z ,i w ). Hence (using that £ = 1/3), 

£(d(i z ,i' z ) + d(i )) < 3£d(i z ,i w ) < d(i z ,i w ), 

which implies that the balls do not overlap. 

Second, note that the connection cost of a client in 8 Z is, by triangle inequalities, at least (1 — 
£)d(i z ,i' z ) = (1 — £)d(i z , OPTx)- We thus have (using that the client-balls are disjoint) that opt-j > 
X) z =i(l — £)d(i z ,OPTz)\B z \- As we only selected optj/t-dense facilities, (1 — £)d(i z , 0PJx)\8 z \ > 
optj/t and hence optj > ^optj/t. It follows that t > t which completes the proof of Lemma [7j 

2.2 Proof of Lemma [8j obtain solution to sparse instance from pseudo-solution 

We start by analyzing the running time of Algorithm [2] Clearly the while loop can run at most 
c iterations (a constant). The number of different pairs (T>, V) in the for loop is at most 




Notice that \T'\ <k + c and c < t. For sufficiently large k and \ J-\, the above quantity is at most 
('"T') X^=o (c+l) = ■ Algorithm [2] can thus be implemented to run in time as required. 

Moreover, it is clear from its definition that it always returns a solution S, i.e., \S\ < k. 

8 



Input: an ^4-sparse instance X = (k, J 7 , C, d), a c-additive pseudo-solution 7~, an integer t > c and 
5 G (0,1/8) 

Output: A solution S satisfying the properties of Lemma [8] 
1: V := T and B := 2 • A+cos g (T)/t 

2: while \T'\ > k and there is a facility i G T' such that costj(T"' \ {i}) < costj(7" / ) + B do 
3: Remove % from T'; 

4: return 5 := T if \T'\ < k; 

5: for all D C f and V C J such that \V\ + |V| = k and |V| < t do 

6: For i £ T>, let Lj = d(i,T' \ {i}) and fi be the facility in FBall(i, 8Li) that minimizes 

^ mm{d(fi,j),d(j,V)} 

j&CBa\\{i,Li/3) 

7: Let 5d, v := V U {/* : i G £>} 

8: return 5 := arg min^ v costx(5x) j v) 

Algorithm 2: Obtaining a solution from a c-additive pseudo-solution. 

We proceed by proving that S satisfies ^p) of Lemma |8j Suppose first that the algorithm 
returns at Line|4j By the condition of the while loop from Line [2] to [3j we increase costj(7"') by 
at most B each time we remove an element from T' ■ We remove at most c elements and thus we 
increase the total cost by at most cB. It follows that (|8b) is immediately satisfied in this case. 

From now on suppose instead that we reached Line [5] of Algorithm [2] and thus \T'\ > k. We 
shall exhibit sets Vq and Vo such that \Vq\ + |Vo| = k, |Vo| < t and cost(5x) 0i v ) — i^H°Pti- As 
Algorithm [2] selects T>q and Vo in one iteration and it returns the minimum cost solution, this 
concludes the proof of Lemma [8j In order to define the sets T>o and Vo it shall be convenient to use 
the following definitions. 

Definition 9. For every facility i G T 1 , let Li = d(i,T' \ {i}) be the distance from i to its nearest 
neighbor in T' , and let ti = d{i, OPTj) be the minimum distance from i to any facility in OPTx- 
For a facility i G T' , we say i is determined if li < 5Li. Otherwise, we say i is undetermined. 

The sets T>o and Vo are now defined as follows. Set T>q contain all facilities in i G T' that 
are determined. If we let /* for i G T>o be the facility in OPTj that is closest to i, then set 
Vo := OPTj \ {/* : i G Vq}. The intuition of Vq and Vo is that the solution Sd ,v 1S very close to 
OPTj: the only difference is the selection of fi at Line [6] of Algorithm [2] instead of /*. Since each 
i G T>q is determined, selecting fi greedily using a "locally" optimal strategy gives a good solution. 

We first show that sets Vq and Vo are indeed selected by Algorithm [2] and then we conclude the 
proof of the lemma by bounding the cost of 5x> 0j v - 

Claim 10. |X> | + |V | = k and |V | < t. 

Proof of Claim. We start by proving that \V \ + |V | = k. Recall that V = OPT x \ {/* : i G Pol- 
It is not hard to see that /* ^ f*, for two distinct facilities in Dq. This is indeed true since d(i, i') > 
max(Li,Li>),d(i,f*) < 5Li,d{i',f*) < bL v and 5 < 1/8. Thus, f*(V ) := {/* : i G T>q} has size 
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Figure 2: Definitions of Vq, Vo and Uq. Dashed and empty squares represent facilities in OPTr and 
T' respectively. Vq is the set of empty squares circles. A dashed circle represents FBall(z, 5 Li) for 
a determined facility i £ Vq. Thus, /* is in the ball since £{ < 5Li. Uq(Vo, resp.) is the sets of 
empty (dashed, resp.) squares that are not inside any circle. A solid circle for i G T>q represents 
the "care-set" of i. 

| Pol) which in turn implies that (to simplify calculations we assume w.l.o.g. that |OPTj| = A;) 

|Vo| = |OPTi|-|2? | = fc-po|- 

We proceed by proving |Vo| < t. Note that the sets of determined and undetermined facilities 
partition T' ■ Therefore, if we let IAq be the set of undetermined facilities, we have that |X>o| 
7 ' \Uq\. Combining this with the above expression for |Vo| gives us 

I Vol =k-\r\ + \u \ < \U \. 

We complete the proof of the claim by showing that \Uq\ < t. 

By the assumption that we reached Line[5]of Algorithm[2j we have \T'\ > k and costx(T'\{i}) > 
costj(T') + B for every i G T' ■ Assume towards contradiction that |Wo| > t. For every i G T', let 
Ci be the set of clients in C connected to i in the solution T' and Cj be the total connection cost of 
these clients. Thus, costj(T') = X^ieT' ^i- Take the facility i G Uq with the minimum C%. Then, 
we have Cj < costz (T')/t. Let i' be the nearest neighbor of i in T '; thus d(i, i') = Li. 

We shall remove the facility i from T' and connect the clients in Ci to i'. In order to consider 
incremental connection cost incurred by the operation, we divide Ci into two parts. 

Ci n CBall(i, <5£Lj). Since i is undetermined, we have 6 Li < and CBall (z, 8£Li) C CBall(z, ££j). As 
I is an ^-sparse instance, % is not an ^4-dense facility. That is (1 — £) |CBall(i, £% < A, 
implying 

(1 + 50 \d n CBall(t, 5iLi)\Li < ^^A < A/(5(). 

Then, as each client in Cj n CBall(«, 5£Li) has distance at most (1 + 5£)Li to i' (by triangle 
inequalities), connecting all clients in d n CBall(z, S£Li) to i! can cost at most A/{5£). 

Ci \ CBall(i, 6£Li). Consider any client j in this set. Since d(j,i') < d(j,i) + L» and d(j,i) > 5£,L; L , 
we have f^'^ < j^fj: = 1/(^0- Hence, the connection cost of a single client is increased 

by at most a factor Therefore, the total connection cost increases by at most Ci/(<5£), 

which by the selection of i is at most costz(T')/(8£t). 
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Summing up the two quantities, removing i from T' can only increase the connection cost by 
at most A+cost i(7' lll m As the while loop of Algorithm 2 ran for less than c iterations, costj(T') < 

costx(T) + cB. Therefore, A+cos g (r ' )/f < A+(costl g^+ cB)/t which since t > 2c/ (5£) is at most 
^+ cos *iC7~)/ f -\- B /2 = B leading to a contradiction. Hence, \Uq\ < t which concludes the proof of 
the claim. □ 

Having proved that the instance Sv ,Vo 1S selected by Algorithm [2j we conclude the proof of 
Lemma [8] by bounding the cost of Sx> 0) v - 

Recall that, for every i G T>q, Algorithm [2] opens one facility /, in the ball FBall(i, 8Li). We 
know we can do this so that the connection cost of C is optj. We show that we can approximate 
this instance within a factor of 1 + 0(5). Roughly speaking, if a client is far away from any of these 
balls, then it does not care which facilities to open inside the balls, up to a factor 1 + 0(5). If a 
client is close to one of these balls, say FBall <5Lj), then we put the client into the "care-set" of i. 
For each i, we open a facility in the ball that is best for its care-set. 

To be more specific, let the care-set of i be CBall(i, Li/3) for any i G Dq. Clearly, the balls 
CBall(i, Li/3), i G Vq are disjoint. As stated in Line [6] of Algorithm [2j we open a facility fi in 
FBall(i, 5 Li) that minimizes 

^2 nun{d(/i,j'),d(j',Vo)}. 

jeCBall(j,Li/3) 

Claim 11. costx(Sx>o,v ) < i^ffoptr- 

Proof of Claim. We compare OPTj and 5x> 0i v - Consider a client j G CBall(i, Li/3) for some i G Vq. 
The distance from j to any facility in FBall(i, 5 Li) is at most (1/3 + 5)Li. For any distinct facility 
i' G T>o, the distance from j to any facility in FBall(i', SLy) is at least d(i,i') — Li/3 — 5L^ > 
d(i,i') - d(i,i')/3 - Sd(i,i') = (2/3 - 6)d(i,i') > (2/3 - 5)L { . For 5 < 1/8, 1/3 + 5 < 2/3 - 5. 
Thus, j is either connected to /* or some facility in Vo in the solution OPTj. Noticing that we 
are selecting the best fi for every i G Vq, the total connection cost of Uiex> CBall(i, Lj/3) in the 
solution 5d 0j v is at most that in OPTj. 

Now, consider a client j that is not in U«ex> CBall(z, Lj/3). If it is connected to some facility 
in Vo in the solution OPTj, then the connection cost of j in the solution <Sd Oi v can not be larger, 
since Vo C S. Assume j is connected to /* G CBall(«, Li/3) for some i G Vq. We compare d(j,f*) 
to d(j, fi): 

d(j,fj) < d(j,i) + 5L l < Lj / 3 + 5 Lj = 1 + 35 
d(j, ft) ~ d(j, i) - SLi ~ Li/3 - 5Li 1-35' 

Thus, Sv ,v has connection cost at most ^Mj^optj. □ 



3 An 0(l/e)-additive 1 + y3 + e approximation for /c-median 

This section is dedicated to prove Theorem [5j Given a A:-median instance X = (k, J 7 , C,d), we first 
use Theorem [3] to obtain a bi-point solution aS± + &1S2 whose cost is at most 2 times the optimum 
cost of X. Then it suffices to convert o5i + &1S2 into an 0(l/e)-additive solution, whose cost is at 
most times that of aSi + bS2- 
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By the definition of bi-point solutions, we have a + b = 1, < k < jj^l and a\J-\\ + 6|J-2| = k. 
It shall be convenient to think of aJ-\ + bJ-2 as a bipartite graph (see Figure [3]) with vertex sets 
T\ and T2 and an edge for each client j G C that is incident to its closest facilities in T\ and Ti 
denoted by h(j) and £2(7)1 respectively. Moreover, let d\{j) := d(j,ii(j)) and efeC?) := d(j,i2(j))- 
Then, the (fractional) connection cost of j in the bi-point solution is ad\{j) + bd-i^j)- Similarly, if 
we let d\ := cost(J r i) = YljeC and ^2 := cost(J 7 2) = X^ec ^(j) then the bi-point solution has 
cost ad\ + bd-2,- 

We shall prove Theorem[5]by exhibiting different algorithms based on the value of a. Specifically, 
we shall distinguish between the cases when a is in ( 



^3-1 
4 



4 



1+V3 



and 



1+V3' 



The 



simplest case is when a £ 



1 



< 1/a = (1 + \/3)/2 approximation. 



the solution where we open all facilities in T\ is then a 



ad\+bd,2 

For the two remaining cases, we will use the concept of stars. For each facility i S J2 define 
7r(i) to be the facility in T\ that is closest to i. For a facility i £ T\ , let Si = {i' £ T2 ■ tt(*') = 
We think of Si as a star with center i and leaves Si. Note that by the definition of stars, we 
have that any client j with 12(3) £ Si has d(i2(j),i) < d(i2(j),h(j)) = G?2(j) + di(j) and therefore 
rf(j) i) < d(j, i2{j)) + d(i2(j),i) < 2<i2 + di- Our algorithms will ensure that there is an open facility 
"close" to every client by always opening i if not all facilities in Si are opened. The strategy for 
either opening the c enter of a star or its leaves (or sometimes both) depends on the value of a. We 



start in Section |3.lj by explaining the simpler case when a G 
of Theorem [5] by considering the final case in Section 3.2 



0, 



■y/3-1 



and then complete the proof 





F\ Rif H I I 

Figure 3: Depiction of the bipartite graph associated to a bi-point solution. The fat edges are the 
clients that form the edges of the stars. For clarity, we only depicted one client j that is not part 
of a star. Client j has distances d{j,i\) = di(j), d(j, 12) = ^2 and d(j,is) < 2d2(j) +d\(j). 



3.1 Algorithm for a G ^0, y \^- 

The idea behind our algorithm is that when a is small then we can open most facilities in Ti. We 
shall do so by starting with the trivial solution T\ that we will improve by almost greedily selecting 
stars and open all their leaves while closing their centers. 

As we will maintain the property that i is open if not all facilities in Si are open, we have 
that the connection cost of a client j is c^O) if ^(j) is open and at most d±(j) + 2d2(j) otherwise. 
Consider the trivial solution where we open all facilities in T\. Then the total connection cost is 
upper-bounded by X^j(^i(j) + 2^2(7)). If we open the facilities in Si instead of i this will save us 
the cost Ylj£6(Si)(di{3) + <h(J))) where S(Si) denotes the clients that are incident to the facilities 
in Si. This motivates the following linear program that maximizes the cost we will save compared 
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to the trivial solution: 



max 



£ £ K(j)+d 2 (j)) 



subject to 



ie.Fi je<5(S;) 



2 ^d^l " 1) < * - l^i 



ie.Fi 



< Xi < 1, Vi G J"i 



Intuitively, Xj takes value 1 if we open all the facilities in Si and if we open i. If Xj = 1, we 
need to open \Si\ — 1 more facilities (i.e, close i and open all facilities in Si). Thus, the constraint 
says that we can only open k facilities. Note that this is a Knapsack LP and hence it is easy to 
see that an optimal solution has at most one fractional variable. Furthermore, %{ = b is a feasible 
solution since b \J~2\ — b [J 7 ^ = k — [J 7 ^. Therefore, the optimal solution to the LP has value at least 
b{d 1 + d 2 ). 

Consider an optimal solution to the Knapsack LP with at most 1 fractional variable. Then, we 
open all the facilities in Si with Xi = 1, all the facilities i £ Ji with Xi = 0, and for the i with 
fractional xi we open i and facilities in Si uniformly at random. (This step can easily be 

derandomized by greedily selecting the [a^Sil] facilities in Si that maximizes the reduced cost.) 

Note that we opened the facilities so that the (expected) saved cost compared to the trivial 
solution is at least the value of the optimal solution to the linear program. Therefore, this gives us 
a solution of (expected) cost at most 2d,2 + d\ — b(d2 + d\) = (1 + a)d2 + ad\. Also, the solution 
opens at most k + 2 facilities, where the additive term 2 comes from the star Si with fractional x,- L 
value. 

Since we can assume that d2 < d\ (otherwise we can simply open all facilities in ~F\), the 
algorithm has an approximation guarantee of 



In this subsection, we give the algorithm for the most complex case. To simplify the arguments, 
we give a randomized algorithm that can easily be derandomized using the standard method of 
conditional probabilities. The idea is that we wish to describe a randomized rounding that opens 
a facility in T\ with probability « a and a facility in Ti with probability ps b and at the same time 
ensuring that there always is an open facility "close" to a client by maintaining the property: if i 
is not open then all facilities in Si are open for all stars. 

We now describe such a randomized rounding that takes a parameter rj > that balances the 
achieved approximation guarantee with the amount of additional facilities we open: the achieved 
approximation ratio is (1+ry) 1+ 2 V ^ while we open at most k+0(\/rj) facilities. It shall be convenient 
to distinguish between large and small stars. We say that a star Si is large if \Si\ > 2/{abrj) and 



(1 + a)d 2 + ad\ 



< (l + 2a), 



(1 — a)c?2 + dd\ 



which is at most if a < V^i . 



3.2 Algorithm for a G 
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small otherwise. Moreover, we partition the small stars into \2/{abrj)~\ groups according to their 
sizes: 

U h = {i G Fx : \Si\ = h} for h = 0, 1, . . . , \2/(abr))] - 1. 

The randomized algorithm can now be described as follows: 

1: For each large star S^. open i and open [Kl^l — 1)J facilities in Si uniformly at random. 

2: For each group Uh of small stars: take a random permutation of the stars in Uh, open the 
centers of the first [a|ZY/j|] + 1 stars, and open all leaves of the remaining stars. In addition, if 
we let L be the number of already opened leaves subtracted from bh\Uh\, then with probability 
\L] — L open [LJ and with remaining probability open \L] randomly picked leaves in the first 
[a|%|] + 1 stars. 

Note that for a large star the algorithm always opens its center and (almost) a b fraction of its 
leaves. For a group Uh of small stars, note that we open either the center (with probability at least 
a) or all leaves of a star. Moreover, we open the additional leaves so that in expectation exactly a 
b fraction of the leaves of the stars in Uh are opened. 

We start by showing that the algorithm does not open too many facilities; we then continue by 
bounding the expected cost of the obtained solution. 

Claim 12. The algorithm opens at most k + 3 \2/(abr/)~\ facilities. 
Proof of Claim. Recall that we have that a \ J~i\ + b | J^l = k and therefore 

J2(a + b\Si\) = k. (3) 

ie.Fi 

First, consider a large star i G F\, i.e., a\Si\ > 1 / (677) > 1/r/. For such a star, the algorithm 
opens 1 + LKI'S'il ~~ 1)J 5; 1 + b(\Si\ — 1) = a + b\Si\ facilities, which is the contribution of star i 
to (§. 

Second, consider a group Uh of small stars and let m := \Uh\- When considering this group, the 
algorithm opens [am] + 1 < am + 2 facilities in T\ , and at most 

(m — \am\ — l)h + \bhm — (m — [am] — l)/i] < bhm + 1 

facilities in Ti. Thus, the total number of facilities open from the group Uh of small stars is at 
most m(a + bh) + 3. Asm is the size of Uh and a + bh is the contribution of each star in Uh to 
the statement follows from that we have at most [2/(0677)] groups. □ 

We proceed by bounding the expected cost of the obtained solution. The intuition behind the 
following claim is that we have designed a randomized algorithm that opens a facility in Ti with 
probability ~ b and a facility in T\ with probability ~ a. Therefore, if we connect a client j to 
*2 0) with connection cost d>2(j) if that facility is open, to with connection cost d\{j) if that 
facility but not 12(3) is open, and to the center i of the star Si : ^(j) £ S% with connection cost at 
most 2^2 (j) + di(J) if neither or iiij) are opened (recall that i is open if not all facilities in 
S{ are open) , then the expected connection cost of client j is at most 

b ■ d 2 (j) + (1 - b)a ■ di(i) + ab(2d 2 (j) + di(i)) = adiU) + b (l + 2a)d 2 (j). 
The following claim then follows by linearity of expectation. 
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Claim 13. The algorithm returns a solution with expected cost at most 

(1 + rj) (odi + 6(1 + 2a)d 2 ) . 

Proof of Claim. Focus on a client j with i\{j) = i\ and i 2 (j) = h as depicted in Figure [3j Let 
i% = 7r(i 2 ) be the closest facility in T\ to i 2 , i.e., 13 is the center of the star Si 3 with i 2 £ S*j 3 . Notice 
that d(i%,i 2 ) < d(ii,i 2 ) < d\(j) + cfoC?) by the definition of it. Thus, d(j, is) < d 2 (j) + d(i3,i 2 ) < 
d\(j) + 2d 2 (j). We connect j to i 2 , if i 2 is open; otherwise, we connect j to i\ if is open. We 
connect j to 13 if both ii and i 2 are not open. (Notice that for a star £«, if i is not open, then all 
facilities in Si are open. Thus, either i 2 or i% is open.) Connecting j to the nearest open facility 
can only give smaller connection cost. By abusing notations we let i% (i 2 , resp.) denote the event 
that i\ (i 2 , resp.) is open and i\ (i 2 , resp.) denote the event that i± (i 2 , resp.) is not open. Then, 
we can upper bound the expected connection cost of j by 

Pr[i 2 ] ■ d 2 (j) + Pr [iii 2 ] • di(j) + Pr [hh] • (2d 2 (j) + Mj)), 

which, by substituting Pr [iii 2 ] = 1 — Prfe] — P r [^1*2] ; equals 

(2 - Pr [i 2 ] - 2Pr [iii 2 ]) d 2 (j) + (1 - Pr [i 2 ]) d^j). (4) 

We upper bound this expression by analyzing these probabilities. 

. If i 2 € (i.e., ii = ^3) then ii is always open if i 2 is closed and 
i 2 ] . If S^ is a large star, then ii is always open and we also have 
Pr \iii 2 ~\ = Pr \i 2 ~\ . In both cases, we have Pr [iii 2 ] = 1 — Pr[i2]. 

We now consider the case where S^ is a small star in a group Uh with m := \Uh\ and ii / ^3. 
Note that if Si 3 is either a large star or a small star not in Uh then the events ii and i 2 are 
independent. We have thus in this case that 

Pr [iii 2 ] = Pr[ii] • (1 - Pr[i 2 ]) 

= M±l.(l_P r[i3 ]) 
m 

It remains to consider the case when Si 3 is a star in Uh- Notice that the dependence between i\ 
and i2 comes from that if i 2 is closed then i^ is opened. Therefore, we have 

Pr [hi 2 ] =Pr [ii|i 2 ] .(1-Pr[i 2 ]) 
\am\ +1 — 1 



Let us start with Pr [iii 2 
thus we have Pr [iii 2 ] = Pr 



m 



(1-PrN). 



We have thus showed that Pr [iii 2 ] is always at least a ■ (1 — Pr [i 2 ]). Substituting in this bound 
in Q allows us to upper bound the connection cost of j by 

(2b + (2a - 1) Pr [i 2 ]) d 2 (j) + (1 - Pr [i 2 ]) d x (j). 

We proceed by analyzing Pr [i 2 ]. On the one hand, if i 2 is a leaf of some big star Si with s = \S{\ > 
2 /(barf) then Pr[i 2 ] = L fe ( g j 1 )J j s greater than b — 2/s >b(l — arf) and smaller than b. On the other 
hand, if i 2 is a leaf of a small star Si in group Uh with m := \Uh\ then in expectation we open 
exactly a 6 fraction of the leaves so Pr [i 2 ] = b. We have thus that 6(1 — arf) < Pr [i 2 ] < b. Since 
(1 + 77) • (1 — arf) > 1 we have that the expected connection cost of facility j is at most (1 + rf) times 

(26 + (2a - l)b)d 2 (j) + (1 - b)d 1 (j) = 6(1 + 2a)d 2 (j) + ad x (j). 

The claim now follows by summing up the expected connection cost of all clients. □ 
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We complete the analysis by balancing the solution obtained by running our algorithm with the 
trivial solution of cost d\ that opens all facilities in T\. 

Claim 14. We have that min {di, adi + 6(1 + 2a)d 2 } < kt ^-{ad 1 + bd 2 ). 

Proof of Claim. We change d\ and d 2 slightly so that ad\+bd 2 does not change. Apply the operation 
to the direction that increases the left-hand-side of the inequality. This operation can be applied 
until one of the 3 conditions is true: (1) d\ = 0; (2) d 2 = or (3) d\ = ad\ + 6(1 + 2a)d 2 . 

For the first two cases, the inequality holds. In the third case, we have d\ = (1 + 2a)d 2 . Then 

ad±+bd 2 = a(i+2n)+i-a = i+2a? 1 max i mum value of the quantity is 1+ 2 V ^ , achieved when 



V3-1 



□ 



We have shown that, by letting 77 = e/(l + Vs), we can efficiently obtain a 0(l/e)-additive 
1 +^+ e -approximation to a bi-point solution with constant a and 6, which proves Theorem [5 when 

'1/3-1 2 



a G 



1+V3 



4 Discussion 



We have given a 1 + \/3 + e-approximation algorithm for fc-median, improving upon the previous 
best 3+ e-approximation algorithm. Besides the improved approximation guarantee, we believe that 
the most interesting technical contribution is Theorem |4j namely that we can approximate k in 
/c-median without loss of generality. More specifically, any pseudo-approximation algorithm which 
outputs a solution that opens k + 0(1) facilities can be turned into an approximation algorithm 
with essentially the same approximation guarantee but that only opens k facilities. 

For A;-median this new point of view has the potential to overcome a known barrier for obtaining 
an approximation algorithm that matches the 1 + 2/e hardness of approximation result: the lower 
bound of 2 on the integrality gap of the natural LP for fe-median. In particular, the known 
instances that give the integrality gap of 2 vanish if we allow k + 1 open facilities in the integral 
solution. Following our work, we therefore find it important to further understand the following 
open question: what is the maximum ratio between the cost of the optimum solution with k + 0(1) 
open facilities, and the value of the LP with k open facilities? One can note that the hardness of 
approximation reduction in [11] implies that the integrality gap is at least 1 + 2/e even if we open 
k + o{k) facilities. Moreover our 0(l/e)-additive approximation for bi-point solutions achieving a 
guarantee of 1+ ^ +<; < 1 + 2/e shows that the worst case integrality gap instances are not of this 
type when pseudo-approximation is allowed. 

Finally, we would like to mention that Theorem [4] naturally motivates the question if other hard 
constraints can be relaxed to soft constraints with a "violation-dependent" increase in the runtime. 
Soft constraints often greatly help when designing algorithms. For example, the capacitated versions 
of facility location and k- median are notorious problems when the capacities are hard constraints but 
better approximation algorithms are known if the capacities are allowed to be slightly violated (see 
e.g. [8]). As our approach was inspired by studying the power of the Sherali- Adams hierarchy |21j 
for the A;-median problem, we believe that a promising research direction is to understand the power 
of that hierarchy and the stronger Lasserre hierarchy [Tj>] when applied to these kinds of problems. 
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